"multilingual" Deep Neural Network for Music Genre Classification
نویسندگان
چکیده
Multilingual deep neural network (DNN) has been widely used in low-resource automatic speech recognition (ASR) in order to balance the rich-resource and low-resource speech recognition or to build the low-resource ASR system quickly. Inspired by the idea of using multilingual DNN for ASR, we use a “multilingual” DNN (Multi-DNN) for music genre classification. However, we do not have “multilingual” in music, so we use the similar resource instead. In order to obtain the similar resource corresponding to small target database, the nearest neighbor (NN) algorithm is used to relabel the large similar database. Then the re-labeled large similar database is used to train a Multi-DNN, and the small target database is used to further adapt the trained Multi-DNN. By using the Multi-DNN approach, the DNN can be well trained, and be transferred to the small target database quickly. The experiments are evaluated on the benchmark databases, ISMIR database and GTZAN database, which are used as the large similar database and small target database respectively. The experiment results show that the proposed method can achieve 93.4% (10-fold cross-validation) average classification accuracy on GTZAN database, which outperforms the state-ofthe-art best performance on this database.
منابع مشابه
Music Genre Classification with Paralleling Recurrent Convolutional Neural Network
Deep learning has been demonstrated its effectiveness and efficiency in music genre classification. However, the existing achievements still have several shortcomings which impair the performance of this classification task. In this paper, we propose a hybrid architecture which consists of the paralleling CNN and Bi-RNN blocks. They focus on spatial features and temporal frame orders extraction...
متن کاملComparing Shallow versus Deep Neural Network Architectures for Automatic Music Genre Classification
In this paper we investigate performance differences of different neural network architectures on the task of automatic music genre classification. Comparative evaluations on four well known datasets of different sizes were performed including the application of two audio data augmentation methods. The results show that shallow network architectures are better suited for small datasets than dee...
متن کاملLearning Temporal Features Using a Deep Neural Network and its Application to Music Genre Classification
In this paper, we describe a framework for temporal feature learning from audio with a deep neural network, and apply it to music genre classification. To this end, we revisit the conventional spectral feature learning framework, and reformulate it in the cepstral modulation spectrum domain, which has been successfully used in many speech and music-related applications for temporal feature extr...
متن کاملImproved Music Genre Classification with Convolutional Neural Networks
In recent years, deep neural networks have been shown to be effective in many classification tasks, including music genre classification. In this paper, we proposed two ways to improve music genre classification with convolutional neural networks: 1) combining maxand averagepooling to provide more statistical information to higher level neural networks; 2) using shortcut connections to skip one...
متن کاملDeep Image Features in Music Information Retrieval
Applications of Convolutional Neural Networks (CNNs) to various problems have been the subject of a number of recent studies ranging from image classification and object detection to scene parsing, segmentation 3D volumetric images and action recognition in videos. CNNs are able to learn input data representation, instead of using fixed engineered features. In this study, the image model traine...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015